Investigating the Validity of a Test Case Selection Methodology for Expert System Validation
نویسندگان
چکیده
Providing assurances of performance is an important aspect of successful development and commercialization of expert systems. However, this can only be done if the quality of the system can be assured through a rigorous and effective validation process. However, a generally accepted validation technique that can, if implemented properly, lead to a determination of validity (a validity statement) has been an elusive goal. This has led to a generally haphazard way of validating expert systems. Validation has traditionally been mostly done through the use of test cases. A set of test cases, whose solution is previously known and benchmarked, is presented to the expert system. A comparison of the system’s solutions to that of the test cases is then used to somehow generate a validity statement. It is an intuitive way of testing the performance of any system, but it does require some consideration as to how extensively to test the system in order to develop a reliable validity statement. One completely reliable statement of a system’s validity could result from exhaustive testing of the system. However, that is commonly considered to be impractical for all but the most trivial of systems. A better means to select “good” test cases must be developed. The authors have developed a framework for such a system (Abel, Knauf and Gonzalez 1996). This paper describes an investigation undertaken to evaluate the effectiveness of this framework by validating a small but robust expert system to classify birds using this framework.
منابع مشابه
Towards an Assessment of an AI System ' s Validity by a
Although there seems to be no (formal) way of proving the validity of an AI system, the authors present some ideas on developing a validity statement based on a Turing-test methodology with a set of "good" 1 test cases. The solution of these test cases will be rated by a panel of expert validators. The methodology is called the Turing-test, because a random process of distributing the test case...
متن کاملA framework for validation of rule-based systems
We describe a complete methodology for the validation of rule-based expert systems. This methodology is presented as a five-step process that has two central themes: 1) to create a minimal set of test inputs that adequately cover the domain represented in the knowledge base; and 2) a Turing Test-like methodology that evaluates the system's responses to the test inputs and compares them to the r...
متن کاملA TURING Test Approach to Intelligent
The authors present some ideas on developing a validity statement based on a Turing test { like methodology with a set of test cases. The (anonymous) solutions of these test cases will be rated by a panel of expert validators. The methodology is called the Turing test, because a random process of distributing the test case with solutions to the diierent validators ensures that no validator know...
متن کاملDevelopment of a QFD-based expert system for CNC turning centre selection
Computer numerical control (CNC) machine tools are automated devices capable of generating complicated and intricate product shapes in shorter time. Selection of the best CNC machine tool is a critical, complex and time-consuming task due to availability of a wide range of alternatives and conflicting nature of several evaluation criteria. Although, the past researchers had attempted to select ...
متن کاملTowards an Assessment of an Ai System's Validity by a Turing Test
The authors present some ideas on developing a validity statement based on a Turing-test methodology with a set of "good" test cases. The objective of this is, of course, to make the result of the validation process (the validity statement) more objective. Furthermore, in an eeort to maximize objectivity, the approach described here includes a competence scale for each validator. This is done f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998